A Restless Bandit Model for Resource Allocation, Competition, and Reservation
نویسندگان
چکیده
In “A Restless Bandit Model for Resource Allocation, Competition and Reservation,” J. Fu, B. Moran, P. G. Taylor study a resource allocation problem with varying requests resources of limited capacity shared by multiple requests. This is modeled as set heterogeneous restless multi-armed bandit problems (RMABPs) connected constraints imposed capacity. Following Whittle’s idea relaxing the Weber Weiss’s proof asymptotic optimality, authors propose an index policy establish conditions it to be asymptotically optimal in regime where both arrival rates capacities increase. particular, they provide simple sufficient condition optimality and, complete generality, method that generates candidate policies which can checked. Via numerical experiments, demonstrate effectiveness these results even pre-limit case.
منابع مشابه
Optimal resource allocation for time-reservation systems
This paper studies the optimal resource allocation in time-reservation systems. Customers arrive at a service facility and receive service in two steps; in the first step information is gathered from the customer, which is then sent to a pool of computing resources, and in the second step the information is processed after which the customer leaves the system. A central decision maker has to de...
متن کاملDynamic priority allocation via restless bandit marginal productivity indices
This paper surveys recent work by the author on the theoretical and algorithmic aspects of restless bandit indexation as well as on its application to a variety of problems involving the dynamic allocation of priority to multiple stochastic projects. The main aim is to present ideas and methods in an accessible form that can be of use to researchers addressing problems of such a kind. Besides b...
متن کاملOptimal Resource Allocation with Semi-Bandit Feedback
We study a sequential resource allocation problem involving a fixed number of recurring jobs. At each time-step the manager should distribute available resources among the jobs in order to maximise the expected number of completed jobs. Allocating more resources to a given job increases the probability that it completes, but with a cut-off. Specifically, we assume a linear model where the proba...
متن کاملA Better Resource Allocation Algorithm with Semi-Bandit Feedback
We study a sequential resource allocation problem between a fixed number of arms. On each iteration the algorithm distributes a resource among the arms in order to maximize the expected success rate. Allocating more of the resource to a given arm increases the probability that it succeeds, yet with a cutoff. We follow Lattimore et al. (2014) and assume that the probability increases linearly un...
متن کاملa benchmarking approach to optimal asset allocation for insurers and pension funds
uncertainty in the financial market will be driven by underlying brownian motions, while the assets are assumed to be general stochastic processes adapted to the filtration of the brownian motions. the goal of this study is to calculate the accumulated wealth in order to optimize the expected terminal value using a suitable utility function. this thesis introduced the lim-wong’s benchmark fun...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Operations Research
سال: 2022
ISSN: ['1526-5463', '0030-364X']
DOI: https://doi.org/10.1287/opre.2020.2066